Reinforcement Learning in Evolutionary Games∗
نویسندگان
چکیده
We study an evolutionary model in which strategy revision protocols are based on agent specific characteristics rather than wider social characteristics. We assume that agents are primed to play a mixed strategy, with the weights on each pure strategy modifiable on the basis of experience. At any time, the distribution of mixed strategies over agents in a large population is described by a probability measure on the space of mixed strategies. In each round, a pair of randomly chosen agents play a symmetric game, after which they update their mixed strategies using a reinforcement learning rule based on payoff information. The resulting change in the distribution over mixed strategies is described by a non-linear continuity equation — in its simplest form a first order partial differential equation associated with the classical replicator dynamics. We provide a general solution to this equation in terms of solutions to an associated finite-dimensional dynamical system. We use these results to study in detail the evolution of mixed strategies in various classes of symmetric games, and in a simple model of price dispersion. A key finding is that, when agents carry mixed strategies, distributional considerations cannot be subsumed under a classical approach such as the deterministic replicator dynamics.
منابع مشابه
Comparing Reinforcement Learning and Evolutionary Based Adaptation in Population Games
In evolutionary game theory, the main interest is normally on the investigation of how the distribution of strategies changes along time and whether an stable strategy arises. In this paper we compare the dynamics of two games in which three populations of agents interact: a three-player version of matching pennies and a game with several Nash equilibria. We do this comparison by three methods:...
متن کاملPh D in Computer Games and Artificial Intelligence
The field of study of the PhD position offered is Computer Games (CG) and Artificial Intelligence (AI) in general. More precisely the games studied are zero-sum two-player complete information games with a high complexity such as Go, Amazons or Shogi. In addition to Tree Search (TS) and knowledge-based techniques, the technique newly used in the computer Go community is Monte-Carlo (MC) [5]. Th...
متن کاملMulticast Routing in Wireless Sensor Networks: A Distributed Reinforcement Learning Approach
Wireless Sensor Networks (WSNs) are consist of independent distributed sensors with storing, processing, sensing and communication capabilities to monitor physical or environmental conditions. There are number of challenges in WSNs because of limitation of battery power, communications, computation and storage space. In the recent years, computational intelligence approaches such as evolutionar...
متن کاملAdaptive agents on evolving networks
In this work we study the learning dynamics for agents playing games on networks. We propose a model of network formation in repeated games where players strategically adopt actions and connections simultaneously using a reinforcement learning scheme which is called Boltzmann-Q-learning. This adaptation scheme in the continuous time limit has a proven relation to the evolutionary game theory th...
متن کاملCreating Intelligent Agents in Games
Game playing has long been a central topic in artificial intelligence. Whereas early research focused on utilizing search and logic in board games, machine learning in video games is driving much of the recent research. In video games, intelligent behavior can be naturally captured through interaction with the environment, and biologically inspired techniques such as evolutionary computation, n...
متن کاملReinforcement Learning in Large Population Models: A Continuity Equation Approach∗
We study an evolutionary model in which strategy revision protocols are based on agent specific characteristics rather than wider social characteristics. We assume that agents are primed to play mixed strategies. At any time, the distribution of mixed strategies over agents in a population is described by a probability measure. In each round, a pair of randomly chosen agents play a game, after ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010